Unitail: Detecting, Reading, and Matching in Retail Scene

نویسندگان

چکیده

To make full use of computer vision technology in stores, it is required to consider the actual needs that fit characteristics retail scene. Pursuing this goal, we introduce United Retail Datasets (Unitail), a large-scale benchmark basic visual tasks on products challenges algorithms for detecting, reading, and matching. With 1.8M quadrilateral-shaped instances annotated, Unitail offers detection dataset align product appearance better. Furthermore, provides gallery-style OCR containing 1454 categories, 30k text regions, 21k transcriptions enable robust reading motivate enhanced Besides benchmarking datasets using various start-of-the-arts, customize new detector provide simple OCR-based matching solution verifies its effectiveness. The evaluation server publicly available at https://unitedretail.github.io .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

passivity in waiting for godot and endgame: a psychoanalytic reading

this study intends to investigate samuel beckett’s waiting for godot and endgame under the lacanian psychoanalysis. it begins by explaining the most important concepts of lacanian psychoanalysis. the beckettian characters are studied regarding their state of unconscious, and not the state of consciousness as is common in most beckett studies. according to lacan, language plays the sole role in ...

Automated Scene Matching in Movies

We describe progress in matching shots which are images of the same 3D scene in a film. The problem is hard because the camera viewpoint may change substantially between shots, with consequent changes in the imaged appearance of the scene due to foreshortening, scale changes and partial occlusion. We demonstrate that wide baseline matching techniques can be successfully employed for this task b...

متن کامل

identifying the strategies persian efl learners use in reading an expository text in english and examining its relation to reading-proficiency and motivation: a think-aloud study

هدف اصلی از این مطالعه بررسی نوع و میزان استراتژی هایی بود که دانشجویان فارسی زبان رشته ی زبان انگلیسی در حین خواندن یک متن انگلیسی به کار گرفتند. این مطالعه همچنین به بررسی تفاوت های استراتژی های مورد استفاده بین دارندگان سطح بالا و پایین درک مطلب پرداخت. نوع همبستگی بین استراتژی به کار گرفته و درک مطلب از یک سو و استراتژی به کار گرفته و انگیزه از سوی دیگر نیز در این تحقیق مورد آزمایش قرار گرف...

15 صفحه اول

Application of Radon Transform in Detecting Turning Angle of Bodies and in Reading Multi - Lingual Documents

Recently, image processing technique and robotic vision are widely applied in fault detection of industrial products as well as document reading. In order to compare the captured images from the target, it is necessary to prepare a perfect image, then matching should be applied. A preprocessing must therefore, be done to correct the samples’ and or camera’s movement which can occur during the...

متن کامل

Application of Radon Transform in Detecting Turning Angle of Bodies and in Reading Multi - Lingual Documents

Recently, image processing technique and robotic vision are widely applied in fault detection of industrial products as well as document reading. In order to compare the captured images from the target, it is necessary to prepare a perfect image, then matching should be applied. A preprocessing must therefore, be done to correct the samples’ and or camera’s movement which can occur during the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20071-7_41